Script Identification in Indian Document Images based on Directional Morphological Filters

نویسنده

  • Mallikarjun Hangarge
چکیده

In this paper, a generalized framework has been proposed to identify different Indian scripts with an observation that every script has a distinct visual appearance. Directional morphological transformations were employed to extract directional linear features of text blocks. Totally, 6460 text blocks of eleven Indian scripts of different scales were classified using K-nearest neighbor and Support Vector Machines (SVM). The results were quite encouraging.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handwritten Script Identification from a Bi-Script Document at Line Level using Gabor Filters

In a country like India where more number of scripts are in use, automatic identification of printed and handwritten script facilitates many important applications including sorting of document images and searching online archives of document images. In this paper, a Gabor feature based approach is presented to identify different Indian scripts from handwritten document images. Eight popular In...

متن کامل

Offline Handwritten Script Identification in Document Images

Automatic handwritten script identification from document images facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate a texture as a tool for determining the script of handwritten document image, based on the observa...

متن کامل

Extracting Vessel Centerlines From Retinal Images Using Topographical Properties and Directional Filters

In this paper we consider the problem of blood vessel segmentation in retinal images. After enhancing the retinal image we use green channel of images for segmentation as it provides better discrimination between vessels and background. We consider the negative of retinal green channel image as a topographical surface and extract ridge points on this surface. The points with this property are l...

متن کامل

Script Identification from Printed Document Images Using Statistical Features

Automatic identification of a script in a document image facilitates many important applications such as automatic archiving of multilingual documents; searching online archives of document images and for the selection of script specific OCR in a multilingual environment. In this work a technique for script identification from document images is proposed. The method uses vertical and horizontal...

متن کامل

Script Identification in Printed Bilingual Documents

Identification of the script of the text in multi-script documents is one of the important steps in the design of an OCR system for the analysis and recognition of the page. Much work has already been reported in this area relating to Roman, Arabic, Chinese, Korean and Japanese scripts. In the Indian context, though some results have been reported, the task is still at its infancy. In the work ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009